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BACKGROUND OF THE INVENTION 

1 . Field of the Invention 

The present invention relates to computer system 
architecture. More specifically, the present invention relates to 
5 retrieving performance monitor data from an I/O processor. 

2 . Background Info3nnation 

Electronic products may be thought of as those products that 
involve the controlled conduction of electrons or other charge 
carriers, especially through microprocessors. Examples of 

10 electronic products may include radios, computers, work stations, 
and servers as well as those involved in high-end networking and 
storage technology. Just about all electronic products employ one 
or more microprocessors disposed within a chip located on a 
printed circuit board- These microprocessors engage a computer 

15 operating system as well as applications. The main central 
processing unit within the chip includoG m ay include a host 
system. It jrs — may be this host system that runs the computer 
operating system and the applications. 

One type of processor within the host system jrs- may be a host 

20 processor having a host memory. Another type of processor that 
may be within the host system i&-may_be an inpur input -output 
(I/O) processor. The I/O processor or I/O Platform (lOP) irS- inay 
be a component of the host system that connects the host system 
memory to an I/O device to process I/O transactions. The I/O 

25 device may be a part of or external to the host system through at 
least one of a primary first bus and a occondary second bus . 
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Examples of I/O devices include storage devices such as a small 
computer systems interface (SCSI) controller for a disk and 
networking devices such as an Ethernet controller. 

One main function of a host system -is- may be to transmit data 
5 between the host memory and an I/O device via the I/O processor. 
Transmitted data includco m ay include application data, local area 
network (LAN) packets, and contents stored on a disk. To 
accomplish data transmission, data handling and processing units 
such as a core processor and a local memory arc m ay be included 
10 within the I/O processor. The core processor and the local memory 
arc m ay be coupled to each other through an internal bus and to a 
messaging unit and a direct memory access unit through that same 
internal bus. Ideally, the system performs within established 
parameters . 

15 To measure and monitor various system parameters that 

contribute to the overall performance of the I/O processor, a 
performance monitoring unit irg- Hnay be integrated into the I/O 
processor. Under current standards, the tasks of the performance 
monitoring unit may include compiling performance measurements on 

20 the three buses: the primary first bus; the gccondairy second bus; 
and the internal bus. The measurements of the performance 
monitoring unit can be used to refine code for improved system 
level performance. However, these measurements exist in raw, 
binary data for which no mechanism exists that gathers and 

25 compiles this raw, I/O performance monitor data into a form that 
readily is— maY_be usable by a computer programmer or operator. 
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SUMMARY OF THE INVEaJTION 

The prcGont invention rolatGO to A n embodiment includes 
retrieving performance monitor data from an I/O processor. A 
perfo3rmance monitoring driver coupled to a performance monitoring 
unit io m ay be registered as a private driver with a real time 
operating system of the I/O processor. Events within the I/O 
processor arc m ay be selected on which to gather data. The 
selected events arc m ay be sent as a message request to the real 
time operating system. The message request -is — may be translated 
into the appropriate parameters based on a set of private group 
parameters >that arc m ay be accessible by the real time operating 
system. The message request jrS- Hnay be sent as a translated 
request to the performance monitoring unit. The pieces of data 
requested by the translated request arc m ay be returned to the 
performance monitoring driver. The pieces of data then arc m ay be 
sent to a location specified in the message request. 
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BRIEF DESCRIPTION OF THE DRAWINGS 
Figure 1 is a functional block diagram of an I/O processor; 
Figure 2 is a block diagram of a networking system 200; 
Figure 3 illustrates a more detailed view of a host processor 
5 and an I/O processor; and 

Figure 4 is a flow diagram of a method of operation 500 of 
the invention. 
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DETAILED DESCRIPTION OF THE INVENTION 
Figure 1 is a functional block diagram of I/O processor 10, 
An example of a known processor functional block diagram -is- ^ay be 
illustrated and described for the Intel® i960® RM/RP 

5 Microprocessor as set out in Intel Corporation, i960® RM/RN I/O 
Processor Developer's Manual, pp. 1-1 through 1-12 (1st ed. July 
1998) . The description regarding Figure 1 4s- Hnay be based on the 

Intel® 1950® RM/RP Microprocessor. 

As show in Figure 1, I/O processor 10 intcgratco m ay 

10 integrate core processor 14 into a s part of a Peripheral 

Components Interconnect (PCI) functionality so as to address the 
needs of intelligent input-output applications (''intelligent I/O" 
or "I2O" applications) . Intelligent I/O applications may be 
coupled to primary PCI bus 16 and/or secondary PCI bus 18 . 

15 Preferably both B oth PCI bus 16 and PCI bus 18 arc m ay be industry 
standard, 64-bit/32-bit , high performance, low latency system 
buses coupled together by PCI-to-PCI bridge 20. A specification 
for the PCI bus jrS— may be set forth in the document PCI Local Bus 
Specification, revision 2.1, October, 1994. This manual io m ay be 

20 prepared and maintained by the PCI Special Interest Group (PCI- 
SIG) . The PCI-SIG jrs- inay be an organization that irg — may be open 
for membership to all companies in the computer industry. 

Along with providing a connection path between the two 
independent PCI buses 16 and 18, bridge 20 provides m ay provide 

25 the ability to overcome PCI electrical loading limits by allowing 
certain bus transactions on one PCI bus to be forwarded to the 
other PCI bus. Core processor 14 -is- Hnay be indirectly connected 
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to PCI-to-PCI bridge 20. Bus interface unit 24 couploo m ay couple 
core processor 14 to internal bus 26. In turn, internal bus 26 is- 
may be coupled to primary transfer group 28 and secondary transfer 
group 30. Internal bus 26 may be a 64-bit bus. PCI-to-PCI bridge 
5 20 is m ay be coupled to primary transfer group 28 through primary 
PCI bus 16 and irS- Hnrtay be coupled to Gocondary transfer group 30 
through secondary PCI group b us 18, each path of which provides 
may provide a link to core processor 14, By communicatively 
connecting core processor 14 to bridge 20, core processor 14 gives 

10 may provide processor ^intelligence^ to bridge 20 to better 
address the needs of intelligent I/O applications coupled to 
primary PCI bus 16 and/or secondary PCI bus 18. 

PCI agents 100 may be coupled to either primary PCI bus 16 or 
secondary PCI bus 18 so as to interact with one of the transfer 

15 groups, 28 and 30. PCI agents 100 may include external PCI 

devices or a host processor, such as host processor 240. Within 
PCI agent 100 may be PCI memory having PCI address spaces. 

Internal bus 26 may be coupled to local memory 38 through 
memory controller 40. Local memory 38 includes m ay include memory 

20 systems external to I/O processor 10 that do not require external 
logic. Examples of local memory 38 may include Synchronous 
Dynamic Random Access Memoary (SDRAM), Read-Only Memory (ROM), and 
Flash memory. 

Primary transfer T ransfer group 28 preferably is m ay be 
25 composed of Address Translation Unit 32, two Direct Memory Access 
channels 34, and messaging unit 36. Secondary transfer Transfer 
group 3 0 preferably is m ay be composed of Address Translation Unit 
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42 and One DMA channel 44. 

Address Translation Unit (ATU) 32 allowa m ay allow 
transactions between the PCI address space within PCI agent (or 
''agents") 100 and address space 46 within local memory 38. 
5 Address translation may be controlled through programmable 

registers (not shown) that arc m ay be accessible from both PCI 
agent 100 and core processor 14. ATU 42 functionij m ay function 
similarly to ATU 32, but performs on secondary m ay work in 
conjunction with PCI bus 18 for PCI agents 100 coupled to 
10 secondary PCI bus 18. Dual access to registers through ATU 32 and 
ATU 42 allows m ay allow flexibility in mapping the coupled address 
spaces . 

To insure p rovide low latency and high throughput data 
transfers between PCI agents 100 and local memory 38, three 

15 separate DMA channels arc m ay be provided as shown in Figure 1. 
Two Direct Memory Access (DMA) channels 34 arc m ay be included 
with primary transfer group 28 and one DMA channel 44 is m ay be 
included with secondary transfer group 30. The three DMA channels 
may operate as a DMA controller to support chaining and unaligned 

20 data transfers. This DMA controller is only m ay be programmable 
through core processor 14 . 

The I2O Architecture Specification describes an open 
architecture for developing device drivers in a system 
environment. Conventionally, based on the I2O Architecture 

25 Specification, messaging unit (MU) 36 provides data transfer 
between PCI agents 100 coupled to primary PCI bus 16 and core 
processor 14. MU 36 can be used to send and receive messages. 
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Moreover, MU 36 intcrruptG m ay interrupt PCI bus agents 100 or 
core processor 14 when new data arrives and passes the data as 
directed. 

Core processor 14 io m ay be interfaced with internal bus 26 
5 through bus interface unit 24. Local memory 38 io m ay be coupled 
to internal bus 26 through memory controller unit 40. 
Microcontrollers 56 arc m ay be interfaced with internal bus 26 
through the series of Inter-Integrated Circuit (I^C) serial bus 50 
and I^C bus interface unit 52. Both local memory 38 and 
10 microcontrollers 56 arc m ay be external to I/O processor 10. 

Application accelerator unit 54 -ig — may be also coupled to internal 
bus 26. 

Memory controller 40 allows m ay allow direct control of local 
memory 38. Core processor 14 oporatCG m ay operate out of local 

15 memory 38 where this memory space jrS- ^ay be independent of PCI 
agents 100. Bus interface unit (BIU) 24 forwards m ay forward 
accesses to core processor 14 to internal bus 26 without 
translation. Microcontrollers 56 may perform management functions 
for the systems of I/O processor 10. Application accelerator unit 

20 (AAU) 54 QXQCutcs m ay execute data transfers to and from local 
memory 38 on behalf of core processor 14 as set out for the 1^0 
standard. 

I/O processor 10 also includes m ay further include internal 
arbitration unit 60 to serve as arbiter for the systems of 
25 internal bus 26, secondary PCI arbitration unit 62 to serve as 

arbiter for the SQcondary PCI bus 18, and performance monitoring 
unit (PMON) 64. 
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As noted above, performance monitoring unit (PMON) 64 may be 
used to compile performance measurements on the three buses: 
primary PCI bus 16, secondary PCI bus 18, and internal bus 26. 
The compiled measurements of PMON 64 can be used for performance 
5 analysis or real-time system tuning by refining code for improved 
system level performance. 

Within the I2O Architecture, data is— maY_be stored in 
parameter groups on I/O processor 10 such as in local memory 38. 
Data in local memory 3 8 may be modified or extracted by host 

10 processor 240 through a message sent by host processor 240 to I/O 
processor 10, However, the I2O Architecture does not provide any 
mechanism for gathering performance measurements compiled by PMON 
64 . The below embodiments of the invention extend the I2O 
Architecture to provide this capability. 

15 Figure 2 is a block diagram of networking system 200. 

Networking system 200 includca m ay include client 300 coupled to 
client 400 through host system 230. Host system 230 may include 
host processor 240 coupled to I/O processor 210 through host 
system or PCI bus 250. Within host system 230 jrS- Hiaay be I/O 

20 device 260 interfaced with I/O processor (lOP) 210. Network lines 
402 aro m ay be coupled to I/O device 260. 

Client 3 00 may be a computer that includGs m ay include data 
input devices such as keyboard 3 02 and mouse 304 and includcG m ay 
include visual monitor 3 06. Preferably, host system 230 

25 physically is m ay be part of client 300, but may be remote from 
client 300. For example, client 300 may be in one location and 
host system 23 0 may be in another location, but connected via 
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communication channels 308 , where channels 309 may include ouch qg 
radio signals, cabling, or the Internet, 

As one example of networking system 200, host system 230 may 
be connected to client 400 through network lines 402. Network 
5 lines 402 may be any form of communication channel over which data 
from host system 230 may be transmitted to client 400. Client 400 
may be composed of one computer or millions of computers. 

Figure 3 illustrates a more detailed view of host processor 
240 and I/O processor 210. Within host processor 240 -is- Hnaay be an 

10 Operating System Specific Module (OSM) 300. Preferably, OSM 300 
is m ay be the host process software for the IjO architecture. OSM 
300 jrs- ^ay be coupled to Real Time Operating System (RTOS) 302. 
The software of RTOS 302 abatracts m ay abstract a large portion of 
I/O processor 210 hardware from the rest of the software that runs 

15 on I/O processor 210, By permitting common commands to enable 

applications within I/O processor 210, RTOS 302 permits m ay permit 
a programmer to develop a driver for one I/O device and get the 
driver running on one or many different I/O processors with 
minimal effort. 

20 Conventionally, drivers arc m ay be used to couple devices 

external to I/O processor 210. As shown in Figure 3, Small 
Computer System interface (SCSI) device driver 312 couples m ay 
couple SCSI controller 310 to RTOS 302 as an example of a storage 
device. Networking N etwork lines 402, as coupled to Ethernet 410, 

25 -3rS- Hnaay be coupled to RTOS 302 through networking n etwork driver 
314 as an example of a networking n etworked device. 

RTOS 302 aids m ay aid in message passing between the external 
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devices and OSM 3 00. By conceptually treating PMON unit 64 as a 
device external to I/O processor 210, the invention takes 
advantage of the message passing features of RTOS 302 by coupling 
PMON unit 64 to RTOS 302 through PMON driver 320. PMON driver 320 
5 irS- Hnttav be a Device Driver Module (DDM) dedicated to performance 
(perf) monitoring resources and may be referred to as "perf DDM" . 
PMON driver 320 may operate by itself or work in conjunction with 
either a storage device or a networking device. 

The system management interface of the I2O Architecture 

10 Specification provides for developing private parameter groups. 
Private parameter groups may reside in the I/O processor memory. 
An Operating System Specific Module may be used to gather data 
from a driver. For example, private parameter groups of the 
invention preferably m ay reside in the I/O processor memory and 

15 OSM 300 may be used to gather data from PMON driver 320 of Figure 
3. 

Each parameter group may include a group number, a group 
type, a group name, a description of the group, and includes m ay 
include one or more parameters belonging to the group. A 

2 0 parameter may be identified by a field number, whether the 

parameter is m ay be readable or writable, the file size of the 
parameter, the parameter name, and a description of the parameter. 
In one embodiment of the invention, three private parameter groups 
contain containing a total of thirty one fields reside as software 

25 in memory 38 of I/O processor 210. 

Within performance monitoring Table 1, Table 2, and Table 3 
below arc m ay be a set of parameter groups that define an 
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embodiment of the invention. Table 1 jrS- Hnav be the Control Group, 
Table 2 jrS- may be the Mode Control Group and Table 3 is— maY_be the 
Mode Data Group. The contents of the parameter groups may include 
the performance monitor data and performance monitor setup 
5 information. Software running on host processor 240 can m ay be 
able to gather or modify one or more parameters stored in these 
parameter groups. 
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Table 1 Performance Monitoring Control Group 0x8 00 Ox 



Group Number 
Group Type 
Name 

Description 


OxSOOOx 
SCALAR 

PERFMON_CONTROL 

A table of all control parameters for hardware -based 
performance monitoring resources 


Field 
Idx 


(r/w) 


Field 
Size 


Parameter 
Name 


Description 












0 


r/w 


5 Bytes 


LockControl 


Bytes 0-3 arc may be the 
AuthenticationKey . 

When the LockControl field is 
read by an application and the 
Locked bit is not set, the 
initiator Target Identification 
(TID) from the UtilGetParams 
message is-^nay be saved, a unicnae 
AuthenticationKey is-nnay be 
returned in these bytes of the 
field, and the Locked bit is-may be 
tentatively changed to the set 
state. The application then has 2 
seconds to lock the performance 
monitoring resources by sending a 
UtilParamsSet message with the 
proper TID to write to the 
LockControl field with the Locked 
bit set and the proper 
AuthenticationKey in Bytes 0-3. 
When this occurs, the resources 
remain locked, if they are 
currently unlocked. Otherwise, 
after the 2 second timeout period 
runs out, the locked bit is may be 
reset and other applications can 
attempt to lock the resources . 
During the 2 second backoff period, 
other applications that read the 
LockedControl field will detect 
that the resources are already 
locked, and the AuthenticationKey 
will be set to something other than 
the proper one. 
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1 


r/w 


11 

Bytes 


Control 


Bit 0 of byte 4 is-inay be the 
Locked bit. 

For the Locked bit, 1 means 
locked and 0 means unlocked. When 
this bit is set along with a proper 
AuthenticationKey, the perfDDM 
saves the initiator TID, fields 
from the UtilParamsSet message. 
Every subsequent UtilParamsSet or 
UtilParamsGet message with an 



Table 1 cont. Performance Monitoring Control Group 0x8 00 Ox 



Group Number 
Group Type 
Name 

Description 


0x8 00 Ox 
SCALAR 

PERFMON_CONTROL 

A table of all control parameters for hardware-based 
performance monitoring resources 


Fieldl 
dx 


(r/w) 


Field 
Size 


Parameter 
Name 


Description 
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1 



1 

cont , 



r/w 



11 

Bytes 



Control 



initiator TID field equal to the 
saved TID value causes a watchdog 
timer to be zeroed. After 5 min., 
if no UtilParamsGet or 
UtilParamsSet messages to the 
perfDDM parameter groups with the 
initiator TID field equal to the 
saved value are received, then the 
locked state bit jrS- Hnay be cleared. 
This mechanism mediates resource 
contention between applications and 
DDMs under development, while 
preventing resource lockout due to 
halted applications. Default is- 
may be unlocked(O) . 

NOTE: Because of the 2 second 
timeout condition, spurious 
UtilGetParams messages that read 
the LockControl field should be 
avoided, since these reads would 
prevent other applications from 
being able to lock the performance 
monitoring resources during the 
backoff period. 

Bytes 0-3 contain the 
AuthenticationKey . 
The AuthenticationKey vcrifyg 
verifies that the configuration 
host application that is— may_^e 
attempting to altor the a lteration 
is the one that locked the 
resources. On writes, this key 
must be equal to the last 
authentication key issued by the 
perfDDM. If equal, the host 
application's UtilParamsSet message 
will be processed normally. If not 
equal to the last issued 
authentication key, then the 
UtilSetParams reply message 
indicates an error. On reads this 
field returns 0. 



Table 1 cont. Performance Monitoring Control Group OxSOOOx 
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Group Niimber 
Group Type 
Name 

Description 


0x8 00 Ox 
SCALAR 

PERFMON_CONTROL 

A table of all control parameters for hardware-based 
performance monitoring resources 


Fieldl 
dx 


(r/w) 


Field 
Size 


Parameter 
Name 


Description 












1 

cont. 


r/w 


11 
Bytes 


Control 


Bit 0 of byte 4 is-^ay be the 
Accumulate count bit. 

For the Accumulate count bit, 
0 means only report deltas for last 
interval and 1 means acciamulate 
counters and time intervals . 
Default is-^ay be Accumulate 
counters and time intervals ( 1) . 

Bit 1 of byte 4 is-^nay be the 
Count ing_On bit. 

The Count ing_On bit gives the 
application a quick way to turn off 
counting in the performance 
monitor. This will be used to 
limit the impact of performance 
monitoring-related bus accesses on 
the counter statistics reported. 
When an application wants to turn 
counting off, it will send the 
smallest UtilParamsSet message 
possible to access only this field. 
When counting is turned ON, the 
saved counter values for each mode 
aro may be zeroed out. Default is- 
may be counting off. 

Note: when Counting_On is 1, all 
writes to any other parameter group 
fields or bits than Counting_ON et^eer 
may be disallowed. An error will 
be returned for such accesses. 

Note: when Counting_On is 0 and 
all the Modellnterval fields for 
all modes arc may be set to 0 (see 
group 0x8001, field #1 below), 
counting cannot be turned on, since 
no modes 



Table 1 cont. Performance Monitoring Control Group 0x8 00 Ox 
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Group Number 
Group Type 
Name 

Description 


OxSOOOx 
SCALAR 

PERFMON_CONTROL 

A table of all control parameters for hardware -based 
performance monitoring resources 


Fieldl 
dx 


(r/w) 


Field 
Size 


Parameter 
Name 


Description 


1 

cont . 


r/w 


11 
Bytes 


Control 


have been allocated counting 
intervals. Attempts to set 
Counting_On in this case will 
result in an 

12 0_PARAMS_STATUS_SCALAR_ERROR 
being returned by the operation. 

Bit 2 of byte 4 is— may be the 
GlobalSendOnCycleEnd bit. 

If bit 2 is set, the EventData 
fields of Event Acknowledge 
Messages for the CYCLE_END event 
will contain selected rows of the 
PERFMON_MODE_DATA table group. If 
this bit is reset. Event 
Acknowledge messages contain no 
EventData. Default is may be 
reset . 

Bit 3 of byte 4 is may be the Pause 
bit. 

The Pause bit can be used to 
temporarily disable counting and 
subsequently reenable counting 
without zeroing the saved counter 
values for each mode. Default is- 
may be OFF. 

Byte 5 is may be the CurrAlgorithm 

The CurrAlgorithm is may be 
used to set the sampling algorithm. 
Initially, there will be two 
sampling algorithms supported: 
User-configurable simple round- 
robin(O) , and distributed round- 
robin (1) . Default: distributed 
round- robin. 

Bit 2 of byte 4 is— may be the 
GlobalSendOnCycleEnd bit. 
If bit 2 is set, the EventData 
fields of Event Acknowledge 
Messages for the CYCLE_END event 
will contain selected rows of the 
PERFMON_MODE_DATA table group. If 
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Table 1 cont. Performanc Monitoring Control Group 0x8 00 Ox 



Group 
Number 
Group Type 
Name 

Description 


OxSOOOx 
SCALAR 

PERFMON_CONTROL 

A table of all control parameters for hardware-based 
performance monitoring resources 


Fieldl 
dx 


(r/ 
w) 


Field 
Size 


Parameter 
Name 


Description 


1 

cont . 


r/w 


11 Bytes 


Control 


this bit is-^ay be reset. Event 
Acknowledge messages contain no 
EventData. Default is-may be 
reset . 

Bit 3 of byte 4 is— may be the Pause 
bit. 

The Pause bit can be used to 
temporarily disable counting and 
subsequently reenable counting 
without zeroing the saved counter 
values for each mode. Default is- 
may be OFF. 

Byte 5 is-^ay be the CurrAlqorithm 

The CurrAlgorithm is may be 
used to set the sampling algorithm. 
Initially, there will be two 
sampling algorithms supported: 
User-configurable simple round- 
robin (0), and distributed round- 
robin (1) . Default: distributed 
round-robin. 

Byte 6 is-nnay be the SampleUnits . 

The SampleUnits specifies the 
units for the selected sample 
interval: usec(O), msec(l), 
sec ( 2 ) , min { 3 ) . Default : msec . 

Bytes 7 through 10 arc may be 
the Samplelnterval; which specifies 
the number of SampleUnits in the 
selected sample interval. Default: 
100, giving a default sample 
interval of 100 msec. If the users 
select as a sample interval less 
than the minimum sample interval, 
the actual sample interval is may 
be rounded up. The largest sample 
interval supported io may be 71 
minutes . 
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5 Table 1 cont. P rformance Monitoring Control Group 0x8 00 Ox 



Group Number 
Group Type 
Name 

Description 


OxSOOOx 
SCALAR 

PERFMON_CONTROL 

A table of all control parameters for hardware-based 
performance monitoring resources 


Fieldl 


(r/w) 


Field 
Size 


Parameter 
Name 


Description 


2 


r 


1 Byte 


MaxMode 


Maximum # of modes for the 
performance monitoring resources . 
DDMs use this to indicate the 
number of modes supported by the 
underlying hardware. Typically 
these arc may be hardware modes. 


3 


r 


1 Byte 


CurrentMode 


Current mode of the performance 
monitoring resources. This is-^ay 
be used for debugging purposes. 


4 


r 


1 Byte 


MaxAlgorith 
m 


Maximum # of sample algorithms. 
DDMs can use this to indicate the 
niomber of algorithms that can be 
supported. Sample algorithms 
may be software-controlled. 


5 


r 


1 Byte 


MinSampleUn 
its 


Specifies the units for the minimum 
supported sample interval : 
usec(O), msec(l), sec (2), min(3). 
This value will be determined by 
the resolution of the RTOS event 
timer. 


6 


r 


4 

Bytes 


MinSampleIn 
terval 


Number of MinSampleUnits in the 
minimum supported sample interval . 
This value will be determined by 
the resolution of the RTOS system 
timer rounded to the nearest 
microsecond. 


7 


r 


1 Byte 


PerfHWType 


Type of performance monitoring 
hardware available: -0 means NONE, 
1 means i960 RN or RM, all other 
values arc may be reserved. 


11 


r 


1 Byte 


NumCounters 


Number of performance monitor 
counters supported by hardware. 


13 


r 


1 Byte 


Samplelnter 
val 

Status 


Provides an indicator of whether 
the user-selected sample interval 
is-may be okay(O), too small (1), or 
too large (2) , adjusted{3) . 
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14 



r 



4 

Bytes 



AdjustedSam 
pie 



Displays the rounded up sample 
interval, e.g. the user-configured 
sample interval rounded up to the 
next integer multiple of the system 
timer resolution . 



Interval 



Notco for Table 1 : The group numbcro for the porformancc 

monitoring private parameter groups preferably will bo determined 
by the 1^0 SIG aoaigncc of the inventor. — Moreover , — the 
AuthcnticationKcy may be baocd on a global timer to guarantee 

uniqucncGG . In addition, — clearing the locked atage bit after five 

minutes preferably will not prevent rcoourco contention between 
multiple applications on the same host since meooagco from thecc 

all have the same initiator TIB value. Originally, — it was thought 

that the initiator context field could be used for this purpose. 
However, — some hoot operating systems control this field. 
Applications have no control over this field. 
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Table 2 Performanc Monitoring Mode Control Group 0x8001 



Group 
Number 
Group Type 
Name 

Description 


0x8001 
Table 

PERFMON_MODE_CONTROL 

A table of mode-specific control parameters for the 
performance monitoring resources. 


Fieldl 
dx 


(r/ 
w) 


Field 
Size 


Parameter 
Name 


Description 












0 


r 


1 Byte 


Mode 


Performance monitoring mode to be 
controlled, 1-7. 


1 


r/w 


8 Bytes 


ModeControl 


Bytes 0-3 contain the 
AuthenticationKey 

The AuthenticationKey is— may 
be used to verify that the host 
application which io attempting to 
alter the configuration is the one 
that locked the resources. On 
writes, this key must be equal to 
the last authentication key issued 
by the perfDDM. If equal, the host 
application's UtilParamsSet message 
will be processed normally. If not 
equal to the last issued 
authentication key, then the 
UtilSetParams reply message 
indicates an error. On reads, 
bytes 3-7 return 0. 

Bytes 4 - 7 of this field contain 
the Modelntervals . 

The Modelntervals represents 
the number of selected sample 
intervals dedicated to a particular 
mode when using the simple round- 
robin; writes to this sub-field 
cause all modes counter values to 
be zeroed, if not already done. A 
flag, count s_zeroed, in perfDDM 
will track this; it will may be set 
when the first Mode Time is zeroed, 
and reset when counting is turned 
on. This will reduce the impact on 
the overall system that would occur 
if the Mode Times for more than one 
mode were altered. This defaults 
to 10 times the sampling interval. 



Noto for Tablo 2 ! — No rowDcloto or rowAdd oporationo ncod bo 
oupportcd by porfDDM oinoo thono ficldo aro prooont for all modoa. 
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Table 3 P rformance Monitoring Mode Data Group 0x8002 



Group Number 
Group Type 
Name 

Description 


0x8002 
Table 

PERFMON_MODE_DATA 

A table of data parameters for all counters in each 
mode , Each row pertains to an individual mode . 


Fieldl 
dx 


(r/w) 


Field 
Size 


Parameter 
Name 


Description 












0 


r 


1 Byte 


Mode 


Performance monitoring mode for the 
data, 1-7, 


1 


r 


8 

Bytes 


CurrTime 


Current accumulated value of GTSR 
register plus rollover bits. 

This value accumulates over 
multiple sample intervals until the 
end of the timcolico io time slice 
may be reached. This is-^ay be 
updated whenever an interval ends, 
if the host application disables 
counting in the middle of an 
interval, or if the host 
application requests the data in 
the middle of an interval . Main 
use of this -is— may be to detez*mine 
how stale or current the data is- 
may be when very lonQ interval 
times arc may be being used. Units 
arc may be the period of the GTSR 
clock. 


2 


r 


8 

Bytes 


SigmaTime 


Accumulated time for this mode at 
end of last completed interval . 
Units arc may be the period of the 
GTSR clock. 


3 


r 


8 

Bytes 


EndingTime 


Value of GTSR plus rollover bits at 
the end of the last completed 
interval . 

This is— may be compared to 
CurrTime to determine the relative 
currency of the counter data . 


4 


r 


8 

Bytes 


Counter 01 


PECOl counter value at end of last 
completed interval. 


5 


r 


8 

Bytes 


Counter02 


PEC02 counter value at end of last 
completed interval . 


6 


r 


8 

Bytes 


CounterOB 


PECOS counter value at end of last 
completed interval , 


7 


r 


8 

Bytes 


Counter 04 


PEC04 counter value at end of last 
completed interval . 


8 


r 


8 

Bytes 


Counter 05 


PECOS counter value at end of last 
completed interval . 


9 


r 


8 

Bytes 


Counter 06 


PECO 6 counter value at end of last 
completed interval . 
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10 


r 


8 

Bytes 


CounterOV 


PEC07 counter value at end of last 
completed interval. 


Table 3 cont. Perfonoance Monitoring Mode Data Group 0x8002 


Group Number 
Group Type 
Name 

Description 


0x8002 
Table 

PERFM0N_MODE_DATA 

A table of data parameters for all counters in each 
mode. Each row pertains to an individual mode. 


Fieldl 
ox 


(r/w) 


Field 

oize 


Parameter 
Name 


Description 












11 


r 


8 

Bytes 


CounterOS 


PEC08 counter value at end of last 
completed interval. 


12 


r 


8 

Bytes 


Counter 09 


PEC09 counter value at end of last 
completed interval. 


13 


r 


8 

Bytes 


Counter 10 


PECIO counter value at end of last 
completed interval. 


14 


r 


8 

Bytes 


Counter 11 


PECll counter value at end of last 
completed interval. 


15 


r 


8 

Bytes 


Counter 12 


PEC12 counter value at end of last 
completed interval. 


16 


r 


8 

Bytes 


Counter 13 


PEC13 counter value at end of last 
completed interval. 


17 


r 


8 

Bytes 


Counterl4 


PEC14 counter value at end of last 
completed interval. 



NotGO for Tabic 3 ! SigmaTimc and all counter valuoo arc 



initialized to zero. — Aloo, — if delta counting is acloctcd, when 
the performance monitor mode is first entered, — Sigma time and all 

5 the counter values are zorocd. In cumulative counting, — the Sig ma 

time and all of the counter values arc not zeroed when the mode io 
first entered, — and when the mode is exited the values in the GTCR 
and all of the counters arc added to the values in this parameter 
group . 
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Figure 4 is a flow diagram of a method of operation 500 of 
the invention. Method 500 may be implemented in a computer 
readable storage medium containing executable computer program 
instructions which when executed cause the networking system to 
5 perform method 500. Also, method 500 may be implemented in a 

distributed readable storage medium containing executable computer 
program instructions which when executed cause an I/O processor to 
perform method 500. 

At initialization 502 of method 500, the software in local 

10 memory 38 of I/O processor 210 initializco m ay initialize . This 
crcatQG m ay create a performance monitoring (PMON) storage table 
in local memory 38. The PMON storage table can m ay be able to 
store and keep separate the various pieces of information 
retrieved from PMON unit 64 and placed in that table. 

15 Additionally, initializing the software in local memory 38 also 
cauQCG m ay cause PMON driver 320 to register as a private driver 
with RTOS 3 02 of I/O processor 210 since PMON driver 320 io not 
may not be a networking type driver or a storage type driver. 

In operation, a user of client 300 or client 400 of Figure 2 

20 initiatOG m ay initiate a performance monitor application at step 
506 that acncratcG m ay generate a selection screen at visual 
monitor 3 06. The selection screen allowo m ay allow the user to 
select those events on primary PCI bus 16, occondary PCI bus 18, 
or internal bus 2 6 for which the user desires to compile data. 

25 The user may also specify the time periods that the user desires 

to see the compiled data at visual monitor 306, In regards to the 
events on primary PCI bus 16, occondary PCI bus 18, or internal 
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bus 26, the Architecture divides monitorable events into 
occurrence events and duration events. Occurrence events is-^nay 
be counted each time the event occurs. For a duration event, a 
counter counto m ay count the number of clocks during which a 
5 particular condition or set of conditions -js- may be true. A total 
of ninety-eight events may be monitored, subject to the 
availability of event counters. 

At step 510, the user oclocts m ay select at visual monitor 
306 those events on primary PCI bus 16, occondary PCI bus 18, or 

10 internal bus 26 for which the user desires to compile data by 

entering their selection in the selection screen. After selecting 
those events the user desired monitored, the information entered 
into the selection screen at visual monitor 306 is-^nay_be sent at 
step 514 to OSM 300 of host processor 240 as a message request_^ 

15 This message recfuest may specify that opocifios specific pieces of 
data from PMON unit 64. OSM 300 relays m ay relay this message 
request to RTOS 302 at step 518 without an understanding of 
whether the message request is for a networking type driver, a 
storage type driver, or a private driver. However, RTOS 302 docs 

20 may recognize this message request as a request for the PMON 

driver 320, previously registered as a private driver with RTOS 
302. 

Using the fields of the three private parameter groups 
residing in local memory 38 of I/O processor 210, RTOS 302 
25 tranGlatcG m ay translate the message request at step 522 into the 
appropriate parameter of the private group parameters of the 
invention. Parameters of the private group parameters arc m ay be 
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set out in Table 1, Table 2, and Table 3 above. 

At step 526, RTOS 302 oondG m ay send this translated request 
to PMON driver 320, In response to receiving the translated 
request, PMON driver 320 quoriGS m ay query PMON unit 64 at step 
5 530 for the specific pieces of data requested by the user. The 
query to PMON unit 64 rGPulto m ay result in the requested pieces 
of data being sent to PMON driver 320 at step 534, PMON driver 
320, in turn, ocndo m ay send this data to the PMON storage table 
in local memoiry 38 at step 538 where the data will m ay be compiled 

10 and stored for dispatch to the user at the time periods specified 
by the user at visual monitor 306 in the selection screen. The 
data preferably io m ay be saved in such a way that another 
application could m ay be written that would take this data and 
present it to client 300, for example, in a meaningful fashion. 

15 At the time periods specified by the user in the selection 

screen, I/O processor 210 oondG m ay send the compiled performance 
monitoring data back to the user through RTOS 302 and OSM 300 at 
step 542 . The compiled performance monitoring data may also be 
directed to other locations within networking system 200 for 

20 purposes such as to cause an effect. For example, the data may be 
sent to an interpreting device that determines whether the server 
of host system 230 is too hot based on the compiled performance 
monitoring data. If so, the interpreting device may generate a 
message that causes an internal fan to turn on. 

25 The exemplary embodiments described herein are provided 

merely to illustrate the principles of the invention and should 
not be construed as limiting the scope of the subject matter of 
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the terms of the claimed invention. The principles of the 
invention may be applied toward a wide range of systems to achieve 
the advantages described herein and to achieve other advantages or 
to satisfy other objectives, as well. 
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